SRU-based Multi-angle Enhanced Network for Semantic Text Similarity Calculation of Big Data Language Model

نویسندگان

چکیده

As a fundamental problem of natural language processing (NLP), the calculation semantic text similarity plays crucial role in variety big data application situations. In process modeling, however, owing to complexity and ambiguity Chinese semantics, effectively capturing interaction characteristics only from single angle is impossible. This study proposes deep learning-based computational model for called SRU-based multi-angle enhanced network (SMAEN). Specifically, authors firstly combine character-grained embeddings word-granularity obtained pre-trained represent text. The encoded using bidirectional simple recurrent unit (Bi-SRU) network, local represented soft-aligned attention technique. addition, integrate Bi-SRU with an improved convolutional neural (CNN) global modeling capture semantic, time, spatial short interaction. Finally, they employ pooling layer aggregate results into fixed-length vector multi-layer perceptual (MLP) classifier make determination. Experimental on public datasets LCQMC PAWS-X show that proposed method fully captures features multiple angles achieves advanced performance. can produce better matching enhance accuracy large analysis. It applicable numerous scenarios involving data, such as information retrieval recommendation systems.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tibetan-Chinese Cross Language Text Similarity Calculation Based on LDA Topic Model

Topic model building is the basis and the most critical module of cross-language topic detection and tracking. Topic model also can be applied to cross-language text similarity calculation. It can improve the efficiency and the speed of calculation by reducing the texts’ dimensionality. In this paper, we use the LDA model in cross-language text similarity computation to obtain Tibetan-Chinese c...

متن کامل

Learning Semantic Tags from Big Data for Clinical Text Representation

In clinical text mining, it is one of the biggest challenges to represent medical terminologies and n-gram terms in sparse medical reports using either supervised or unsupervised methods. Addressing this issue, we propose a novel method for word and n-gram representation at semantic level. We first represent each word by its distance with a set of reference features calculated by reference dist...

متن کامل

Similarity Calculation Method of Chinese Short Text Based on Semantic Feature Space

In order to improve the accuracy of short text similarity calculation, this paper presents the idea that use the history of short text messages to construct semantic feature space, then use the vector in semantic feature space to represent short text and do semantic extension, and finally calculate the short text similarity of corresponding vector in the semantic feature space. This method can ...

متن کامل

Calculation Method for Optimal Routing in Traffic Network Big Data

In allusion to the influence of traffic flow change on vehicle routing selection, the article aims at finding the method for planning the trip route under the condition of knowing traffic flow change. The method combining particle swarm algorithm and dynamic planning is used for routing optimization in order to obtain the excellent route for various vehicles influenced by traffic flow. The road...

متن کامل

JU_CSE_NLP: Multi-grade Classification of Semantic Similarity between Text Pairs

This article presents the experiments carried out at Jadavpur University as part of the participation in Semantic Textual Similarity (STS) of Task 6 @ Semantic Evaluation Exercises (SemEval-2012). Task-6 of SemEval2012 focused on semantic relations of text pair. Task-6 provides five different text pair files to compare different semantic relations and judge these relations through a similarity ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Information Technologies and Systems Approach

سال: 2023

ISSN: ['1935-570X', '1935-5718']

DOI: https://doi.org/10.4018/ijitsa.319039